Skip to main content

Chat Models

OrganizationModel NameAPI Model StringContext lengthQuantization
OpenAIGPT OSS 120Bopenai/gpt-oss-120b128000MXFP4
OpenAIGPT OSS 20Bopenai/gpt-oss-20b128000MXFP4
DeepSeekDeepSeek R1 Distill Llama 70Bdeepseek-ai/deepseek-r1-distill-llama-70b65000FP16
Mistral AIMistral (7B) Instruct v0.3mistralai/Mistral-7B-Instruct-v0.332768FP16
NvidiaNemotron Orchestrator 8Bnvidia/Orchestrator-8B16384FP16
MicrosoftFara 7Bmicrosoft/Fara-7B8192FP16

Code Models

OrganizationModel NameAPI Model StringContext lengthQuantization
QwenQwen3 Coder 30B A3B InstructQwen/Qwen3-Coder-30B-A3B-Instruct131000FP16

Image Models

OrganizationModel NameAPI Model StringModel TypeDefault steps
Qwen Tongyi MAIZ Image TurboTongyi-MAI/Z-Image-TurboImage Generation9
Stability AIStable Diffusion 3.5 Largestabilityai/stable-diffusion-3.5-largeImage Generation30
QwenQwen Image EditQwen/Qwen-Image-EditImage Edit20

Audio models

OrganizationModalityModel NameAPI Model String
OpenAISpeech-to-TextWhisper Large v3openai/whisper-large-v3

OCR Models

OrganizationModel NameAPI Model StringContext length
TencentHunyuan OCR (1B)tencent/HunyuanOCR16000

Vision models

OrganizationModel NameAPI Model StringContext length
QwenQwen3-VL 8B InstructQwen/Qwen3-VL-8B-Instruct32768
QwenQwen3-VL-30B-A3B-InstructQwen/Qwen3-VL-30B-A3B-Instruct128000
QwenQwen2.5-VL 7B InstructQwen/Qwen2.5-VL-7B-Instruct32768

Embedding models

Model NameAPI Model StringModel SizeEmbedding DimensionContext Window
BGE-Large-EN-v1.5BAAI/bge-large-en-v1.5326M1024512